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Abstract. Large deviation principles and related results are given for a class 
of Markov chains associated to the "leaves" in random recursive trees and pref- 
erential attachment random graphs, as well as the "cherries" in Yule trees. In 
particular, the method of proof, combining analytic and Dupuis- Ellis type path 
arguments, allows for an explicit computation of the large deviation pressure. 



1. Introduction and results 

We consider in this article large deviations and related laws of large numbers 
and central limit theorems for a class of Markov chains associated to the number 
of leaves, or nodes of degree one, in preferential attachment random graphs and 
random recursive trees, and also the number of "cherries," or pairs of leaves with a 
common parent, in Yule trees. The random graphs studied model various networks 
such as pyramid schemes, chemical polymerization, the internet, social structures, 
genealogical families, among others. In particular, the leaf and cherry counts in 
these models are of interest, and have concrete interpretations. In Subsection 11.21 
we discuss applications with these models and literature. 

Define the nondecreasing Markov chain {Z n : n > 1} starting from initial state 
Zi = fco > 0, by its one-step transitions, for n > 1, 

(1.1) Pr (Z n+1 - Z n = v\Z n ) = 

where {s„ : n > 1} is a sequence of positive numbers such that 

(1.2) s n > fco + n — 1, an d > a for 1 < a < oo 

n 

with convention 0/0 = 0. Additionally, we also consider two special sequences, 
s n = n and s n = n/2 with fco < Si, related to some applications. 

We first note, in this form, Z n can be seen to represent the "mass" of red balls in 
a Polya-Eggenberger-type urn of two colors, red and blue, not necessarily tenable, 
where at each time n, proportional to the red balls mass, a (signed) mass s n +i — s n 
of blue balls is deposited, otherwise one red ball and mass s„+i — s n — 1 of blue 
balls is added. 



Date: September 26, 2008. 
2000 Mathematics Subject Classification. 60F10;05C80. 

Key words and phrases, large deviation, central limit, preferential attachment, planar oriented, 
uniformly random trees, leaves, cherries, Yule, random Stirling permutations. 
Research supported in part by Taft Research Seminar 2006/07 and 2008/09. 
Research supported in part by Taft Research Seminar 2008/09. 
Research supported in part by NSF grant DMS-0504193. 



1 - If if U = 1 
^ if V = 



2 



WLODEK BR.YC, DAVID MINDA, AND SUNDER SETHURAMAN 



Also, when s„ = (fc + 2(n - 1) + n/3)/(l + [3) and a = (2 + /3)/(l + /?) for 
/3 > — 1, the chain has interpretation as the number of leaves in preferential at- 
tachment random graphs with weight function f(k) — k + [3. Later, in Subsection 
11.21 we also remark that s n can be taken as a random sequence with respect to the 
number of "generalized" leaves, or "buds," that is those nodes, possibly with degree 
greater than one, which however connect to only one other vertex, in preferential 
attachment graphs with random edge additions. 

In addition, when s n = an for a = 1,2, Z n is also the number of leaves in 
uniformly and planar oriented recursive trees respectively. In the latter case a = 2, 
Z n is also the number of "plateaux" in a random Stirling permutation of length 2n. 

Moreover, when s n = n/2, Z n can be seen as the number of cherries in Yule 
trees. 

As the urns add mass of the opposite color, one should expect almost sure lim- 
its for the mean behavior and Normal fluctuations. In fact, by mostly martingale, 
combinatorial and urn methods, laws of large numbers (LLN) and central limit the- 
orems (CLT) have been proved, at least in the examples mentioned above when s n 
is linear with slope a (cf. Theorem II .4p . Sec [9, 36, 40 with respect to preferential 
attachment, [28], [41] with respect to recursive trees, [34] with respect to Yule trees, 
and [33 with respect to urns when a is a positive integer. 

Characterizing the associated large deviations is a natural problem which gives 
insight into the properties of rare events, and seems less studied in urns or random 
graphs. Previous work has concentrated on analytic methods with respect to "sub- 
traction" urn models-not applicable in our general setting |24]-or extensions of the 
Dupuis- Ellis weak convergence approach (cf. [20]) to different allocation models 
than ours [ST] , [33] . We note also some exponential bounds via martingale concen- 
tration inequalities are found in the case s n is linear with slope a [15]. See also 
[H [6] [T21 [17l [19l [27] for other types of large deviations work in various random tree 
models. 

In this context, our main results are to prove a large deviation principle (LDP) 
for Z n /n with an explicitly computed "pressure," or Legendre transform of the 
associated rate function (Theorem ll.ip . This is done in two different ways for the 
important case s n is linear with a — 2. Such explicit computations are not common- 
place, and our "ODE" method is quite different from the methods in [24 where 
a quasi-linear PDE is solved, or in |44j where a finite-dimensional minimization 
problem is obtained. 

Perhaps a main feature of our work is that the method given appears robust 
and applicable in diverse, not necessarily urn settings. In particular, we show the 
LDP for Z n /n does not feel the disorder in the sequence s n , is not dependent on 
the initial value Z\, and is the same as for the chain with a regular, linear s n with 
slope a. We mention that this is a consequence of a large deviation principle for 
the path interpolation of Z\ nt \/n ( Theorem II. 3[) , perhaps of interest in itself, that 
we establish, by the Dupuis-Ellis weak convergence approach. 

In addition, aside from laws of large numbers which are trivially obtained, we 
prove a central limit theorem for Z n through complex variables arguments with 
the pressure (Theorem ll.4[) . These alternate proofs of the LLN and CLT, although 
indirect, apply when the mass additions take on some negative non- integer values 
where less is known in the literature. Moreover, the results give a "quenched" 
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LLN, CLT and LDP's with respect to "generalized" leaves or buds in a preferential 
attachment scheme with random edge additions (see Subsection 1 1.2|L 

Our technique to prove the LDP for Z n /n is to consider the recurrence relation 
for m n (X) = i?[exp{AZ„}] obtained from 

(1.3) m„ +1 (A) = (l-e A )^^ + e A m„(A). 

Dividing through by m n (A), we write 

M4) m »+i(A) = l-e A m' n (X) | ^ 

m n (X) s n /n nm n (X) 

The idea now is to take the limit on n in the above display. When the "pressure" 
A exists, it satisfies A(A) = lim n _ >00 (l/n) logm„(A). In this case, it is natural to 
suppose that the limits 

<(A) 



(1.5) A'(A) = lim 

(1.6) e A(A) = lim 



nm n (X) 
m n+ i(A) 



m„(A) 

both exist. Then, from (|1.4[) . we can write the ODE 



(1.7) e A(A) = - — — A'(A) + e A ; A(0) = 0. 

a 

One can compute the solution of this differential equation (cf. (|1.12[l ) and show it 
is unique. 

The main task is to show that the pressure and limits (|1.5|) and (|1.6j) exist. But, 
the pressure exists as a consequence of the path LDP for Z\ n t\ /n by contraction 
principle. We note, in principle, one can try to compute the pressure or the rate 
function from (|1.13p by the calculus of variations, but we found it difficult to solve 
the associated Euler equations (cf. near (|4.ip ). 

Finally, we show (|1.5|) and (|1.6p exist by extending m n (A) to the complex plane, 
and then analyzing its zeroes and analytic properties. These estimates are also 
useful for the central limit theorem arguments. 

We now mention a different approach, in the spirit of [24] , when s n is linear with 
slope a — 2 and also interestingly a = 1/2, 1, to compute the pressure from analysis 
of the generating function G(X,z) = J2n>i m n (X)z n ^ 1 . From (II. 3| . we can write 
the linear PDE 

r s dG, \ . e A — 1 dG . 

1.8 — l-e A z + — = e G. 

az a o\ 

One can solve implicitly this PDE, and locate at least hcuristically a singular point. 

Then, formally, from root test asymptotics, the pressure would be the reciprocal of 

the location of the singularity. 

The difficulty is in establishing the analyticity of the solution and identifying 

its singularity. For urn models of "subtraction type" where the mass added is an 

integer, and which satisfies a balance condition, [53] uses this program to obtain 

large deviations and the CLT. However, the cases s n is linear with slope a = 

1/2, 1, 2, and more generally the urns associated with non-integer s n are not covered 

by their arguments which seem to rely on integer additions with a certain "negative" 

structure. On the other hand, we are able to supply the needed analyticity and 
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singularity identification when s n has slopes a — 1/2, 1,2, and in this way prove 
the LDP for Z n /n (Theorem 1 in these cases. 

The plan of the paper is to state the results in Subsection ll.il discuss applications 
to random graphs in Subsection 1 1.2i give the generating function proof of Theorem 
O in Section [H prove the path LDP (Theorem [L3]) in Section give the ODE- 
method proof of Theorem l 1 . 1 1 and prove the LLN and CLT (Theorem ll.4p in Section 
IU and conclude in Sectional 

1.1. Results. We recall the setting for large deviations. A sequence {X n } of ran- 
dom variables with values in a separable complete metric space X satisfies the large 
deviation principle (LDP) with speed n and rate function / : X — * [0, oo] if I has 
compact level sets {x : 7(x) < a}, and for every Borel set U G $x, 

- inf I(x) < liminf-logPr(X„ E U) 

xG[/° n— too n 

(1.9) < limsup-logPr(X„ e U) < - inf 7(x). 

n—>oo fl x6f/ 

[Here U° is the interior of U and U is the closure of U.] 

Often the rate function is given in terms of the Legendre transform of the "pres- 
sure" when it exists. When X = R, this representation takes the form 

(1.10) I(x) = sup {Ax - log A( A)}, 
where we recall the pressure satisfies 

(1.11) A(A) := lim - \ogE[e xz "}. 

n — >oo fi 

Recall now the Markov chain Z n (jl.ip corresponding to sequence {s n } and pa- 
rameter 1 < a < oo (II. 2[) . 

Theorem 1.1. The sequence Z n /n satisfies LDP with speed n and good strictly 
convex rate function I given by (|1.10[) with pressure 

(1.12) A(A) - -logf-jEL- [ X (^Z1Y ds ) forA^O 



and A(0) = 0. 

Remark 1.2. We note, when s n — an with a = 1/2,1 and fco < si, the LDP 
is also found, with pressure given by formula (|1.12|) , using the generating function 
approach (cf. Section [5]). For a = 1/2 and integer a > 1, the integral in (|1.12[) can 
be evaluated explicitly. 

We now consider the LDP for the family of stochastic processes {X n (t) : < t < 
1} obtained by linear interpolation of the Markov chain (|1.1|) . 

v i \ 1 rr nt — \ nt\ . fco 

X n {t) := — Zi nt i_ k +1 H (Zi nt i_ fc +2 — Z\nt\-ko+i) ioT t > — 

n n n 

and X n (t) := t for < t < k$/n. The trajectories of X n (f) are non-decreasing 
Lipschitz functions with constant at most 1. 
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Theorem 1.3. As a sequence o/C([0, 1];M.) -valued random variables, X n satisfies 
the LDP with the rate function I : C([0, 1];M) — > [0,oo] given by 



(1.13) 



tp(t) log 



attp(t) 



at — ip(t) 



(l-£(t))log 



i/ </>(r) is differ entiable for almost all t, <ys(0) = 0, < ip < 1, anci t/ie integral 
converges; otherwise, I (ip) = oo. 

By the contraction principle, Theorem 11.31 implies the LDP for Z n /n with the 
rate function given by the variational expression 



(1.14) 



I{x) = mf {/(¥>): <p(0) = 0, <pQ)=x}. 



In general, optimal trajectories are not straight lines — exceptions are the LLN tra- 
jectory (f a {t) = ta/(a + l) and the extreme case ip(t) = t — but they try to stay near 
the LLN line (for which I(ip a ) = 0) to minimize cost before going to destination x 
(cf. Fig. EJ. 




Figure 1 . Thick curves are numerical solutions of the Euler equa- 
tions for (jOgl with a = 2 for x = 0.13, 2/3,0.85, 1. Dashed lines 
are straight lines from (0,0) to (1,2;). 



Lemmas for the proof of Theorem 11.11 give Normal approximation. The law of 
large numbers also follows from Theorem ll.il 



Theorem 1.4. We have 



n 



a+l 



and also 



1 



\Z n — E[Z n \ 



N(0,a 2 ) where a 3 



(l + a) 2 (2 + a) 



Remark 1.5. The conclusions of Theorem 11.41 also hold when s n — an with a 
1/2, 1, and ko < s% (cf. Subsection[ 



1.2. Applications to random graph models. With respect to random graphs, 
the Markov chain Z n , representing the number of leaves, sits in the intersection of at 
least two models, that is preferential attachment graphs with linear-type weights, 
and uniformly and planar oriented trees. Also, when s n = an for a = 1/2, Z n 
represents the count of cherries in Yule trees. 
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1.2.1. Preferential attachment graphs. Preferential attachment graphs have a long 
history dating back to Yule (cf. 35J ) . However, since the work of Barabasi- Albert 
[TJ [5] , these graphs have been of recent interest with respect to modeling of various 
"real- world" networks such as the internet (WWW) , and social and biological com- 
munities. Leaves, or nodes with degree one, in these networks of course represent 
sites with one link, or members at the "fringe." [See books [23], [15], [11] for more 
discussion.] 

The idea is to start with an initial connected graph G\ with a finite number 
of vertices, and say no self- loops (so that the vertices have well-defined degrees). 
At step 1, add another vertex, and connect it to a vertex x of G\ preferentially, 
that is with probability proportional to its weight, f{d x )/^2 yeGi f(d y ), to form a 
new graph G2. Continue in this way by adding a new vertex and connecting it 
preferentially to form Gk for k > 1. Here, the "weight" of a vertex is a function / 
of its degree d x . When / : N — * 1Z+ is increasing, already well connected vertices 
tend to become better connected, a sort of reinforcing effect. 

Our results are applicable to linear weights, f(k) = k + f3, for j3 > —1, which 
correspond to certain "power law" mean degree sequences. Namely, let Zk(n) be 
the number of vertices in G n with degree k for k > 1. It was shown, by martingale 
arguments, in [9], [36], and by embedding into branching processes, in [40] that 
Zk(n)/n — > r-fc a.s. where 

From the applications point of view, the parameter (3 can sometimes be matched 
to empirical network data where similar power-law behavior is observed. 

As the number of vertices with degree 1, or the "leaves" Zi(n), increases by one in 
the next step when a non-leaf is selected, and remains the same when a leaf is chosen, 
we see that Z\(n) corresponds to the Markov chain Z n with s n as specified below. 
Since at each step the total degree of the graph augments by two, the probability at 
step n that a vertex x 6 G n is selected is (d x +(3)/(da 1 + 2{n— l) + (n— 1 + |Gi|)/3). 
Then, s„ = (d Gl + 2(n - 1) + (n - 1 + |Gi|)/3)/(l + (3) and a = (2 + /3)/(l + f3). 
Here d Gl and |Gi| are the total degree and number of vertices in G\ respectively. 

One can also "randomize" the model by adding a random number of edges at 
each step: Let {7^} be a sequence of independent identically distributed random 
variables on N with finite mean, 7 = E^i] < 00. Then, at step n, we add a new 
vertex to the graph G n and connect it to a node selected preferentially from G n 
with 7„ edges put between them. The effect of these random edge additions is 
to randomize further the weight given the nodes in the graph; the "deterministic" 
model above is when P(7i = 1) = 1. In [3], by embedding into branching processes, 

it was shown, when £7 (71 log 71) < 00, that Z^{n)/n — > r^ where r^ is given by an 
integral formula with asymptotics r& ~ l/k 3+ @^ . See |16] for an even more general 
randomized model where also a LLN is proved. Other models with different random 
edge addition schemes are found in [TI5], [3"T] . 

In this "randomized" model, we now define the notion of "generalized leaves" 
or "buds," that is, those nodes which connect to exactly one other vertex, albeit 
possibly with several edges linking them. Leaves are those buds with exactly one 
edge connection. Similar to the leaves in the deterministic setting, at step n, the 
bud count increases by one exactly when the new vertex, a fortiori a bud, connects 
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to a non-bud in G n , and remains the same when the new vertex links to a bud in 



Then, the Markov chain Z n , representing the number of buds at step n, corre- 
sponds to si = (d Gl + \Gi |/?)/(l + (i) and for n > 2, 



which satisfies assumption (|1.2[) , by the strong law of large numbers, with a = 
(2j + [3) / (1 + fl) a.s. In particular, by our results, we obtain "quenched" LLN, CLT 
and LDP's, with probability one with respect to {7^}, for the buds in this scheme. 

1.2.2. Random recursive trees. Random recursive trees are also well-established 
models, dating to the 1960's, with applications to data sorting and searching, pyra- 
mid schemes, spread of epidemics, chemical polymerization, family trees (stemma) 
of copies of ancient manuscripts etc. Leaves in these trees correspond to "shutouts" 
with respect to pyramid schemes, nodes with small "affinity" in polymerization 
models, "terminal copies" in stemma of manuscripts etc. See [41], [42] and refer- 
ences therein (e.g. |37| etc.) for more discussion. Below, we mention connections 
with Stirling permutations. 

Similar to preferential attachment, the recursive schemes form a sequence of 
trees. One starts with a single vertex labeled 0, and then adds a vertex at step 
n > 1, labeled n, to one of the n nodes already present. When the choice is made 
uniformly, the model forms a uniformly recursive tree. However, when the choice 
is made proportional to the degree, a non-uniform or plane oriented tree is grown. 
The interpretation is that in uniform trees, at each distance from the root 0, there 
is no ordering of the labels. However, in plane-oriented trees, different orders give 
rise to distinct trees; that is, after selecting a parent node at random at step n with 
k children with a certain cyclic order of labels, there are k + 1 locations where the 
new label n can be inserted. 

Plane oriented trees are similar to preferential attachment graphs with (3 = 
(cf. Chapter 4 23J), and the number of leaves corresponds to the Markov chain 
with fc — 1; s n — 2n — 1 for n > 1, and a = 2. On the other hand, the number of 
leaves in uniformly recursive trees corresponds to the case k = 1, s n = n for n > 1, 
and a = 1. With respect to both types of recursive trees, LLN's and CLT's have 
also been proved by combinatorial, urn and martingale methods (see |41j . [28]). 
So, in this context, our results give alternate proofs of the LLN and CLT for these 
recursive trees, and also an LDP for Z n /n. We note also Theorem 1 1 . 31 yields a path 
LDP for planar oriented trees. 

We comment now on recent connections of planar oriented trees with Stirling 
permutations (cf. [29], [30]). A Stirling permutation of length In is a permutation 
of the multiset {1, 1, 2, 2, . . . , n, n} such that for each i < n the elements occurring 
between the two i's are larger than i (cf. [26 ). It turns out that each permutation 
is a distinct code for a plane oriented recursive tree with n + 1 vertices. 

Quoting from [29], consider the depth first walk which starts at the root, and 
goes first to the leftmost daughter of the root, explores that branch (recursively, 
using the same rules), returns to the root, and continues to the next daughter and 
so on. Each edge is passed twice in the walk, once in each direction. Label the edges 
in the tree according to the order in which they were added-edge j is added at step 
j and connects vertex j to an previously labeled vertex. The plane recursive tree is 



n • 




n-1 



s 
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coded by the sequence of labels passed by the depth first walk. With respect to a 
tree with n + 1 vertices, the code is of length 2n, where each of the labels 1,2, ... , n 
appears twice. Adding a new vertex means inserting the pair (n + l)(n + 1) in the 
code in one of the 2n + 1 places. 

In a Stirling permutation a\a% • • • aini the index < i < 2n is a plateau if 
di = di+i (where a2 ?l +i = 0). [29] shows that the number of leaves in a plane 
oriented tree with n + 1 vertices is the number of plateaux in a random Stirling 
permutation. See [25] for more details. 

1.2.3. Yule trees. Since Yule's influential 1924 paper [33], Yule trees, among other 
processes, have been used widely to model phylogenetic evolutionary relationships 
between species (see [5] for an interesting essay). In particular, the counts of various 
shapes and features of these trees can be studied, and matched to empirical data 
to test evolutionary hypotheses. In [34], a LLN and CLT is proved for the number 
of cherries, or pairs of leaves with a common parent, in Yule trees. Associated 
confidence intervals are computed, and some empirical data sets are examined to 
see their compatibility with "Yule tree" genealogies. Other related statistical tests 
and limit results can be found in [3 [39] . 

In the Yule tree process, one starts with a root vertex. It will split into two 
daughter nodes at step 2, each of which is equally likely to split into two children 
at step 3. At step n, one of the n leaves in the tree is chosen at random, and it then 
splits into two daughters, and so on. It is easy to see that the number of cherries at 
step n is given by the Markov chain Z n with s n = n/2 and ko = 0. Correspondingly, 
our results give different proofs of the LLN and CLT for the cherry counts, as well 
as an LDP for Z n /n. 

2. Proof of Theorem II. 11 Cases s n = an, a = 1/2, 1,2 

In this section we give a proof of the LDP in Theorem 11.11 via an analysis of a 
generating function, for the special case s n = an for a = 2, at the intersection of 
several models, and two additional values a — 1/2, 1 not covered by Theorem 11.31 

The LDP is obtained by Gartner-Ellis theorem |T8l Theorem 2.3.6] from the 
following. 

Proposition 2.1. Suppose s n — an with a — 1/2, 1, or 2, and Z\ = ko < s±. The 

limit (jl.lip exists, and is given by the smooth function 



A(A) 




for a = 1/2 
for a = 1 



(2- 1 ) = log I o 7TX i TT 1 ^r a 



2 (e 



when X ^ 0, and A(0) = 0. 



We follow analytic arguments adapted from [24] , Since < ko < Z n < n + ko — 1, 
we get < m„(A) < e (™+ fc o) A with A + = max{A, 0}. Therefore, for all complex z 
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with \z\ < e A+ , G(z, A) is well defined and satisfies (II. 8|) with the initial condition 
G(0, A) = e fe ° A . The coefficients of this PDE do not vanish in the regions 

V+ = {(>,A) : A > 0, \z\ < e~ A }, X>_ = {(z, A) : A < 0, \z\ < l}. 

For A 7^ 0, the PDE can be solved by the method of characteristics. Clearly, m n (A) 
are the coefficients in the Taylor expansion at z = of the solution, and e~ A ( A ) 
is the radius of convergence of the series that can be determined by singularity 
analysis. 

For a = 1, ko = 0, 1; using initial condition G(0, A) = e k ° x we get 
G(X,z) = eM*(« A -i)-A) fi. 



1 - exp(z(e A - 1) - A) ' 

Hence, the singularity of G as a function of z nearest to the origin is a simple pole 
at 

A 

e A - 1 

By Darboux's asymptotic method [381 Ch. 8], 

-log£;(exp(AZ„)) -> log^— - 
n A 

and the LDP follows. 

Next, consider a = 1/2. In this case, the solutions of the PDE differ depending 
on the region T>±, but are explicit so there are no difficulties in constructing their 
analytic extensions. Using the initial condition ko = 0, we have G(0, A) = 1, and 
the solution of (11.81) is 



A > 



G(X,z) 



tan ( arctan Ve A — 1 — zVe x — l) 



VT-e 



A 

A < 



tanh (arctanhVl — e A — zyl — e A ) 



Hence, the singularity of G as a function of z nearest to the origin is a simple pole 
at 



arctan yje x — 1 

= A > 



-'o 



arctanhVT — < 



VI - c 



A 

A < 



; A 

Once again, by Darboux's asymptotic method [38l Ch. 8], 

-log-B(exp(AZ„)) -> logl/zo 
n 

and the LDP follows. 

For a = 2, the method of characteristics gives the following answer. 

Lemma 2.2. Suppose tp is a function of one complex variable, analytic in a domain 
T> containing (-co,— 2). Then 

(2.2) GO, A) := (e A -l)V(z(e A -l) 2 + 2A-2e A ). 
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satisfies the PDE (|1.8p for all A ^ and \z\ small enough. Furthermore, the initial 
condition is fulfilled at A ^ if 

(2.3) ^(2A-2 e A ) = ^p^- 

Proof. The verification of the initial condition is trivial, and the verification of the 
PDE is a straightforward calculation. Denoting for conciseness ip(z, A) = (p(z(e x — 
l) 2 + 2A - 2e A ), 4> {1) {z, A) = ip'{z(e x - l) 2 + 2A - 2e A ) we verify, for A ^ and z 
such that z(e x - l) 2 + 2A - 2e A G X>, that 

d -^R = 2{e x -l)e^{z,\) + 2{e x ~lf{ze x - l)^\z,\) 

and 

= ( e A -l)V (1) (z,A). 
Equation (|1.8p now follows by a calculation. □ 



Our next goal is to show that one can find a solution of (|1.8|) which can be 
analytically extended in variable z to a large enough domain. To this end, we 
need to analyze (|2.3p with complex argument. The basic plan consists of noting 
that function f(x) = 2(x — e x ) is analytic, strictly decreasing for x > 0, strictly 
increasing for x < 0, and /(0) = —2. The derivative f'(x) vanishes only at x = 0, 
so /|[o,oo) and / | (—00,0] have continuous inverses h + : (— oo, — 2] — > [0, oo) and 
h- : (— co, —2] — > (— oo, 0], and both are analytic on (— oo, —2). 

Clearly, if we define 

e k h±(X) 

^ ¥±W - {eh±w _ 1)a , 

then <p + satisfies (|2.3p for A > 0, and t/j_ satisfies (|2.3p for A < 0. The goal is 
therefore to find the appropriate analytic extensions of the functions h± . 

2.1. Construction of an analytic extension. We need to analyze f(z) = 2(z — 
e z ). The closely related function z + e z appears in [351 page 116] but proofs are not 
included there; we give details for f(z) = 2(z ~ e z ) for completeness. 
The proof relies on the following univalence criterion. 

Lemma 2.3 ( Wolff- Warschawski-Nishiro). If g is holomorphic in a convex region 
f2 and g'(£l) C H , a half-plane with G dH , then g is one-to-one on £1. 

Proof. This is 22, Theorem 2.16 on page 47] applied to the function e l9 g(z) with 
a real constant chosen appropriately to rotate H . □ 

Lemma 2.4. f is a one-to-one mapping of the half-closed strip £ = {z : < 
< 7r} onto a slit closed half-plane: {w : 3(u>) < tt} \ (—00, —2]. The boundary 
correspondence is as follows: f maps K + ni injectively onto R + Am, and f is 
one-to-one on both [0, +oo) and (— oo,0] and maps each onto {— oo,— 2] (cf. Fig. 

W- 

Proof. We write f(z) = g((), where £ = e z , g(() = 2 (log C — C) an d l°g denotes the 
principal branch of the logarithm. The function £ = e z is a one-to-one mapping 
on the strip S. The image of the interior of E is the upper half-plane H — {£ : 
3(C) > 0}. Furthermore, the upper edge R + iri of E is mapped onto (— oo, 0) and 
the bottom edge R is carried onto (0, +oo). 
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The derivative of g(Q isg'(() = 2C/|C| 2 -2. In particular, 3( 5 '(C)) = -23(C)/|C| 2 < 
for (ei. By Lemma l2~3l with Q = H, g is one-to-one on the half-plane H. Under 
g the image of H is {w : $i(w) < 2ir} \ (—00,— 2], and on the boundary we have 
g(Mr) = R + 2-Ki and g(R + ) is the slit (-00, -2] twice covered with 5(1) = -2. 
Perhaps this latter statement is easier seen directly from /; the slit is covered twice: 
/((-oc,0])=/([0,+oc)) = (-oo,-2]. □ 




Figure 2. Composition of maps f(z) — g(e z ) for the proof of 
Lemma 12.41 

We investigate the conformal mapping /|S in more detail. The preimage of 
[— 2,oo) under /|£ is the curve 7 given by 5ftz = log^z/sinSz), < 3z < ir. 
This curve begins at the origin and becomes asymptotic to R + iri in the positive 
direction. By removing the curve 7, S is cut into two parts, £_ and S + . £_ is 
bounded by Mr, 7 and R + iri with the latter line part of E_. The region E + is 
bounded by 7 and R + . Then is a conformal mapping of £_ onto the half- 

closed strip S = {w : < 3w < 2tt} with /(R + iri) = R + 2iri, f(R~) = (-00, -2) 
and 7(7) = [—2, +00). Similarly, /|S+ is a conformal mapping of E + onto the 
lower half-plane H = {z : z € H}, where z denotes the complex conjugate of z, with 
/(M+) = (-00, -2) and /( 7 ) = [-2, +00). (See PigS) 

Both maps /|S_ and extend to conformal maps of larger regions. Because 

f(z) = f(z), / maps E+ conformally onto H. As /(R + ) = (—00,— 2), / is a 
conformal map of fl + := S + U R + U S^" onto the slit plane H U (-co, -2) Ui = 
C \ [—2, +00). Let h+ : C \ [—2, +co) — > fi+ be the inverse function for The 
conformal extension of /|S~ is more involved to describe. The fact that f(z) = f(z) 
implies / maps £_ conformally onto S. Since /(M~) = (—00, —2), / is a conformal 
map of A = £- U 1" U El onto the slit closed strip S = S U (-co, -2) U S = 
{w : \$sw\ < 27r} \ [—2, +00) with the upper (lower) edge of Ao corresponding 
to the upper (lower) edge of Sq. It is straightforward to verify that f(z + liri) = 
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FIGURE 3. Conformal maps /|S_ and /|£+. 



/(z) + 47r« for each n£Z. This functional relationship implies that / is a conformal 
map of A„ = A + 2irni onto S n = Sq + Anni for each n G Z. Hence, / is a 
conformal map of f2_ := U„ e zA„ onto the infinitely slit plane 5^ :— Unez ^« ~ 
C \ Unez (I - 2, +oo) + 47rm). Let /i_ : Soo — > be the inverse function of /|fi-. 
Because C \ [— 2,+oo) D Soo, we may regard h+ as defined on Soo, so /i± have a 
common domain. (See Fig. 3]) 

The two conformal maps h± just constructed provide analytic extensions to Soo 
of the real- valued functions h±. This allows us to define a pair of functions 

k Q h±(z) 

^ w±M : = (eh± w-i)2 

which are analytic in .Soo. 

Lemma 2.5. For each point u S (— oo, —2) i/ie power series expansion of <p± has 
radius of convergence 2 — u, (p± is analytic in the slit disk D(u, r) \ [— 2, +oo), where 
r = r(u) = y/{u + 2) 2 + 16n 2 . Furthermore, ip±(w) ~ —l/(w + 2) as w — > —2 in 

Soo ■ 

Proof. For each point u € (— oo, —2) the power series expansion of h± has radius of 
convergence |u + 2|, the distance from u to — 2 because —2 is the closest singularity 
of h±. Also, h± is analytic in the slit disk D(u,r) \ [— 2,+oo), where r — r(u) — 
wju + 2) 2 + 167r 2 is the distance from u to — 2 ± 4-7rz. 

Note that singularities of tp_ that arise from h-(w) — 2nm are located at the 
slits taken out of 6*00 and that h+(w) G 0+ cannot take values in 2niri. Thus tp± 
is also analytic in the slit disk D(u,r) \ [—2, +00). 
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-2 

-ZtS 









FIGURE 4. Conformal maps f\tt + and /|f2_ with fi + := E + U 
K+ U S7 and = IXL-ao A «- 



Substituting w = 2(z — e z ), we see that 



lim (w + 2)ip±(w) 

-2, wGSoo 



lim ( 

-2, mes, 



feo'»±(«'). 



W - 



2 lim e 



k u z 



( e h±(w) _ 1)2 
1 + z - e z 



(e z - l) 2 



□ 



Conclusion of vroof of Provosition \KT\ for a — 2. We now prove (|l.lip and iden- 
tify the limit. Using functions (p± constructed above, define 

~ /( e A -l)V + We A -l) 2 + 2A-2 e A ) if A > 

[ ' 1 ' j ' l(e A -l)VMe A -l) 2 + 2A-2e A ) if A < . 



From Lemma |2~21 we see that function G(z, A) satisfies (|1.8[) for all (z, A) S 2? + U2?_. 
By uniqueness of the PDE solution in each of the two regions, G(z, A) = G(z, A) for 
all (z, A) S 2?+ U X>_. In particular, for each fixed A ^ 0, limsup ra _ too (™„(A)) 1 / n 
is the reciprocal of the radius of convergence of the series expansion of G(z, A) at 



z = 0. The latter is (1 



\\-2 



times the radius of convergence for if 

A 



siy 



n(A)(w) at 



u = 2A — 2e 6 (— oo, —2) which by Lemma T2. 51 is 2(e — A — 1). Furthermore, the 
lemma implies that there is r/ — 77(A) > such that tp S i gn (\) (w) is analytic on the slit 
disk {w : \w — u\ < 2(e A — A— l)(l+77)}\[— 2, 00). Since after appropriate translation 
and re-scaling this slit disk is larger than the indented disk A(tt/A, 77) introduced 
in [551 (2-5)], and the second part of Lemma [2.51 gives tp±(w) ~ —l/(w + 2) as 
w — > — 2, we can apply [551 Corollary 2] to get p. lip . 

Finally, as m„(0) = 1 the convergence at A = holds trivially. □ 
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3. Proof of Theorem IL3 



We follow the method and notation of Dupuis-Ellis [20]. Although some ar- 
guments are analogous to those found in 20, Chapter 6] which considers random 
walk models with time-homogeneous continuous statistics, and [44] where a differ- 
ent model with a time singularity at t = is examined, for completeness wc give 
all relevant details as several differ, especially in the lower bound proof. 

Let X™ = Zj-k +i/n for k < j < n, and set X™ — j/n for < j < k . 
Then, noting (|l.ip . given X" , we have X" +1 = X" + v™ /n where «™ has Bernoulli 
distribution p an (j/n),x n ■ Here, 



s L „ t _ fco+1J /n fort > » 
for t < % 



Pax(A) = ~5 (A) + 5UA) for 4cK, < x < a, 

a \ a/ 

and po,a ■= S\. 

Define X n as the polygonal interpolated path connecting points (j/n,X") for 
< j ; < n. Also, for probability measures fi -C v such that d/i — fdv 1 denote 
R{p\\v) — J flogfdu, the relative entropy; set R{u\\v) = oo when fj, is not abso- 
lutely continuous with respect to v. 

Let h : C([0, 1],R) — > R be a bounded continuous function. To prove Theorem 
If .31 we need only establish Laplace principle upper and lower bounds [50] page 74] . 
The upper bounds are to show 

limsup-log£{exp[-n/i(A")]} < - inf {i (tp) + h(tp)} 
n— >oc n ' v>eCi 

for a rate function I and a closed subset of continuous functions C±. The lower 
bounds are to prove the reverse inequality 

liminf-log£{exp[-n/i(A: n )]} > - inf {lUp) + hUp)\. 
Define, for fco + 1 < j < n, noting X™ = j/n for j < fcp is deterministic, that 



W n (j,x j ,...,x ko+1 ) = logS 

n 



-nh(X") 



'^fcn+1 



X k + 1 



and 



W n := M/ n (fc o ,0) 



f 



logS 



Then, by the Markov property, for k + l<j<n — 1, 



-nW n (j,Xj,...,x kQ + 1 ) 



-nW rl (i + l.X^ +1 ,x j ,...,x kQ + 1 ) 



Vfl Vn ry. 



e -„W"( J -+l > x,+./n,^,-,^o +1 )d (0£rr(0 . /n)):c .(d u ). 

By a property of relative entropy [20] Prop, f .4.2 (a)], for ko + 1 < j < n — 1, 

= -ilog J e- nWn ^ +1 ^ + y/ n ^- x ^ PanU/nhX] (dv) 

= inf {-iZ(7||/!V n {j/ n ),x } ) + I W n (j + l,Xj + y/n,Xj, . . .,X!)j(dy)\. 
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Also, the boundary condition W n (n,x n , . . . ,Xk +i) = h(x.) holds with respect to 
the linearly interpolated path x. — x n connecting {(l/n,l/n)}o<i<k , 

The basic observation in the Dupuis-Ellis method is that W n (j,Xj, . . . ,Xk +i) 
satisfies a control problem ([20l section 3.2]) whose solution for ko < j < n — 1 is 

n — l 

V n (j, Xj , . . .,x k0+1 ) = inf E. hXj _ Xko+l {- R{vn-)\\P„ n{ i/n),x f ) + h(X n )}. 

i=j 

Here, v™(dy) = v^(dy; x^ Q , . . . , x.A is a Bernoulli distribution given x^ , ■ ■ • , £j for 
k < i < n - 1 and in the display z/f (•) = v?(dy\X% Q , . . . , X? )■ {XV- ■ < i < n} is 
the adapted path satisfying X" = l/n for < I < kg, and XJ l +1 = X" + Y t n /n for 
ko < I < n—l where Y™, conditional on (XJ l , . . . XJ! ) has distribution «"(•); X n is 
the interpolated path with respect to {X™}; and Ej tXJt ___ iXhQ+1 denotes conditional 
expectation with respect to the X n process given the values {XJ 1 = x\ : k + 1 < 
I < j} at step ko + 1 < j < n. The boundary conditions are V n (n, x ni . . . , Xk +i) = 
h(x.) and 

1 n—l 

(3.1) V n (k ,V>) = V n = inf E[- R(v?(-)\\Pa„ W n),x T ) + HX n )}. 



j=k 

In particular, by (2Ql Corollary 5.2.1], 

-nh{X") 



(3.2) W n = --logS 

n 



= V 



The goal will be to take Laplace limits using this representation. To simply later 
expressions, we will take = Si for < j < ko — 1 when ko > 1. 

3.1. Upper bound. To establish the upper bound, we first put the controls {v™} 
into continuous time paths: Let v n (dy\t) = v™(dy) for t 6 {j/n, (j + l)/n], < j < 
n — l, and v n (dy\0) — Vq . Define 

v n (A x B) = j v n {A\t)dt 
Jb 

for A C R, B C [0, 1]. Define also the piecewise constant path X n (t) — X™ for 
t G {j/n, (j + l)/n], <j < n- 1, and A n (0) = 0. Then, 

(3-3) V n = mf ^{^ 1 i?K(-|i)||p ff „ (L „ tJ/ „ ) ,x-. (t )M + M^ n )}- 

From this control representation, as |V n | = |W"| < ||/i|Joo and po-.^ is supported on 
K = {0, 1}, for each n, there is supported on if and corresponding v n (dy x 
dt) = v n (dy\t) x eft such that 

W»+e = V n + e> B[[ i2(« n 0|t)||p ffB(LntJ/n)) x« (t) )* + ^. w )'. 

«/ 

As the sets if and 

r = |<p G C([0, 1];R) : y>(0) = 0, (pi, Lipschitz, with bound 1 j 

are compact on R and C([0, 1],R) respectively, and are probability measures 
on if and {X n } C T, by Prokhorov's theorem, the distributions of (v n (dy x 
dt),X n ) have a subsequence which converges weakly to a limit distribution gov- 
erning (v, X.) where for each realization, v is a probability measure on K x [0, 1] 
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and X. £ r. More precisely, let (f2, F, P) be a probability space where v : — > 
probability measures on X x [0, 1], and X : fl — > I\ Then, [201 Lemma 3.3.1] gives 
that u is the subsequential weak limit of v n , and P-a.s. for w S f2, 

u(A x P|a>) = / v(A\t,uj)dt 
J B 

for some kernel v(dy\t, ui) on isT given t and w. 

Now, by the same proof given for [201 Theorem 5.3.5] (only [201 equation (5.12)] 
in the theorem statement differs; in our context /i there is replaced by p a (jj n ^ x?)> 

as K is compact, we have (v n ,X n ,X n ) has a subsequential limit in distribution 
to (v,X,X) [the last coordinate with respect to D([0, 1],R)]. Also, P-a.s., for all 

te[o,i], 



x(t) = 



X[0,1] 



yv(dy x ds) 



yv{dy\s)ds. 



o ■/ -R" 



In particular, P-a.s., X(t) = J K yv(dy\t). 

By Skorohod representation theorem, we can assume (u n , X n ,X n ) — > (u, X, X) 
converges a.s. In particular, 1" converges uniformly to X a.s., and it is clear that 
X n converges uniformly to X a.s. as X is continuous ( (20J, p. 154]). 

Let fi s (-) = {l-S)- 1 !^-) and v n (dy x figdt) = v n (dy\t) x (j, s dt. Then, by [13 
Lemma 1.4.3 (f)] we have, for < S < 1, that 



= (1 - S)R(v n {dy x nsdt) P an{[nti / n ),X"(t) x M*)- 



Write, 
lim inf V' 



> lim inf E 



^ n (^)\\Pa A lnt W n),X Ht ))dt + h(X n ) 



> lim inf P 



(1 - S)Riv n (dy x pftft) P aB(LnlJ/n)| x»(t) x MM + h(X.) 



> E 



R{v{-\t)\\p atMt) )dt + h{X.) 



where in the last line we use Fatou's lemma, noting lower semi-continuity of R, 
v n (dy x jjLgdt) — ► v{dy\t) x p,$dt a.s., and P an n n t\/ n ) x n (t) conver g es m distribution 
to p at x(t) f° r * ^ [M] a - s - since cr n {\nt\/n) — > at, X"(i) — > X(t) uniformly on 
[0, 1] a.s., and p ff x is continuous on {(cr, x) : S < a < 1, < x < a} (cf. Section 6.2 

EQl). 

By [201 Lemma 3.3.3(b)], 



where 
(3.4) 



R(v(-\t)\\p a t,x(t)) > L[at,X(t),J yv{dy\t) 
L(t,x,y) = sup {%- log J e 6z p t , x (dz)} 
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We note, for t > 0, L(t, x, y) diverges when x = 0, < y < 1, and x = t, < y < 1 
but is finite when < x < i and < y < 1 , and in this case evaluates to 

(3.5) L(t,x,y) = y\og(-^-\ +(l-y)l g('fc^ 

understood with the usual convention 01og(0) = 0. 
Since J yv(dy\t) — X(t), we have 

liminfV™ > e\ [ L(at. X(t), X(t))dt + h(X.) 

As L > 0, and X(-) S L, letting S J. 0, we obtain, 

liminfV™ > inf / L(at,ip(t),ip(t))dt + h(<p). 

n^oo ip£T Jq 

Taking into account (|3 - 2[) , the upper bound holds with C\ = T and 

I(tp) = f L{at,<p{t),ip{t))dt. 
Jo 

3.2. Lower bound. In the following, for typographical convenience, we write 
E(X; A) for / A XdP. Let now ip* e T be such that I(ip*) < oo, and fix a bounded, 
continuous (in the sup norm) function h : C([0, 1]; R) — ► R. In view of ()3.2j) . we 
need only show, for each e > 0, that 

(3.6) limsupF™ < /(?/;*) + /i(^*) + 8e. 

n — >oc 

Step 1. Our first goal is to replace ip* by its appropriate regularization. We use the 

trick of considering a convex combination of paths, 

Mt) = {l-0)i>*(t) + 6t 

for < 6 < 1. Since \\ipg — tp*\\oo < 20, it is clear, for small enough 9 > 0, that 
IMV'e) — M^*)! < £ - Further, since I is finite on the line y = t with slope 1, by 
convexity arguments [5U1 Lemma 1.4.3 (b)], for small enough 8 > 0, 

J(^) < 7(^*)+E. 

We therefore fix > such that /(^e) < I(ip*) + £ and ft^e) < h(ip*) + e. 
Next, following [20l p. 82], we write 

(3.7) <p K {t) = / j K (s)ds, 

Jo 

where 

/4+i/k 
ipe(s)ds 
-I K 

for t € («/ K 7 (* + 1)/ K L < i < « — 1, and J K (Q) = j k (1/k). For large enough k, we 
have 

(3.8) ^ > > 

(3.9) /i(^ K ) < h(r) + 2e 

(3.10) 7(^ K ) < J(V>*)+2 £ . 



18 



WLODEK BRYC, DAVID MINDA, AND SUNDER SETHURAMAN 



Inequality (|3.8|) is a property of tpg, and is preserved by (13. 7p . Since 

lim sup \ipe(t) ~ ¥>«(*) | = 0, 
K ^°°te[o,i] 

inequality (|3.9p follows from continuity of /i by our choice of 9. Then, as 

r\tK\/K _ 

¥>«(*) = « / ipe(s)ds -> 

J [i^J /ft 

as k | oo, and (t, a;, y) i— > L(at, x, y) (cf. (|3.4[) . (|3.5|) ) as a > 1 is bounded, uniformly 
continuous on the compact set 

Ut,x,y) : < t < 1, Qt < x < t, 6 < y < l|, 

by the dominated convergence theorem, lim K /(< / 9 K ) = I(ipe). Inequality (|3.10l) 
follows due to our choice of 9. 

We now fix k such that the above bounds hold. 

Since (|3.10p implies I(f K ) < oo, we now choose a < 6 < 1/3 such that 
(3.11) / L(at,<p K (t),ip K (t))dt < e. 



We will also need an estimate on s n . From assumptions (|1.2[) . there is an < 
n < 1/2 and fci > fco + 1 such that for n > k\, 

Tl 

(3.12) j] < < 1-n. 

Sn-ka+l 

With respect to r\ and 9, we impose additionally that 5 satisfies 

(3.13) -8log{8r]9) < e/10. 

Siep i?. We now build a sequence of controls based on <p K . Recall that we already 
set v'j(dy) — 6% for < j < fco — 1 when fco > 1. Define 

J/VnO'A*).^ for fc o < j < fci 
\ Pi,i-v?«(i/") for j > fci + 1. 



Vj(dy;x ko ,. ..,Xj) 



Note, for j > fci + 1, ^™ does not depend on auxiliary inputs Xfc , . . . ,Xj, and is in 
fact the Bernoulli distribution with success probability (p K (j/n). 

Define also Xf = Z/n for < Z < fc , and XJ +1 = X™ + Yf/n for j > fc where 

P(Y? e dy|X ", . . . , X") = Vj (dj,; X ™, . . . , Xf). 
Thus, for j > 1, Xf = (1/n) y " where i 1 ?} 

j>fei+i are independent Bernoulli 
random variables with corresponding means {<p K {j /^)}j>fci+i- 

Siep 5. We now collect some useful estimates. 

A. Since Y n = 1 (when fco = 0, recall p a $ — Si), and the increments are at most 
one, we have 1/n < X" < j/n for 1 < j < n. 

B. We have 



(3.14) lim sup 

™T°°o<j/n<i 



-£l>«(i/») 



1=0 



a.s. 
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Indeed, for large enough n, as < ip K < 1, (1/n) X^=o IV'tO'/ 71 )! + &l/ n < n 
Then, by Doob's maximal inequality, 



sup 

0<j/n<l 



i=0 



> 



2nV« 



< Cn 1 / 2 ^™ - E(XZ) 
1 " 



(3.15) 



c(£;^(Z/n)(l-p«(l/n)) j 1 < Cn- 3 / 2 . 



vi=0 



where C is a constant changing line to line. 
C. For < j < n, from (|3~8j) . it follows that 



(3.16) 



1=0 



n 



D. Let j > ki + 1. Noting 1/n < Aj 1 < j/n (cf. part A) and bounds f3"J2]) . we 
have 



77 < 1 



< 1 



— < 1, 



1 X? 

< - 1 — < 1 - 77. 



Sj-ko+l VnU/n) Sj-ko+1 o n {j/n) 



Hence, L(a n (j/n),X™,tp K (j/n)) can be well evaluated (cf. (|3.4[) . (|3.5|) ), and we 
may rewrite the relative entropy as 

R(v";\\p anUM ,x 7 ) = y K {3/n)\og[<p K (j/n)/{l - X?/v n (j/n))] 

+ (1 - M3M) log ((1 - v> K U/n))/(Xf/a n (j/n))) 
= L(<T n (j/n),X?,<p K (j/n)). 

Further, as < ip K < 1, we have 

L(a n (j/n),X?M3/n)) 

= [4> K {j/n) h>g(p K (j/n) + (1 - y> K (j/n)) log(l - <£«(?/«))] 

-0 re (i/n) log (1 - Xpa n (j/n)) - (1 - <p K (j/n)) log (X?/<r n (j/n)) 
(3.17) < 0-log77 + log( Sj _ feo+1 ) < logOVr; 2 ), 

the last inequality using (|3.12[) again. 

Step 4- We now argue (|3.6[) via representation (|3.1[) . Let 
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Since i?(w™||p (Tri ( J y n ) ^n) = for fco < j < ki, the sum in (13. 1| equals 



n-l 



E 



In *—f 

j=k 



n-l 



Vj \\P<r n U/n),XV 



= E 



E 



- L(a n (j/n),XJMj/n)) 



n-l 



" L(vn{j/n),X",tp K {j/n));, 

_l n — 1 

e\ 



- J2 L(a n (j/n),Xf,MjM); 



j=k! + l 



(3.18) = Ai+A 2 . 

Step 5. We now treat the first term A x in l[3"~Igj) , Combining (|3~T7j) with |[3TT5]) . we 
obtain, for large u, 



1 1 J 

- ^ L(a n (j/n),X?,v K (j/n)); sup |^ - - 5>«(Z/n)| > 



j=fci+i 
< log(n/?7 2 )P 



0<J<71-1 



i=0 



2nV8 



sup 

0<j/n<l 



rj ^- — ' 



i=0 



> 



2n 1 /s 



(3.19)< Clog(n/r/ 2 )n- 3/2 < 



Step 6. For the other term A2 in (|3. 18|) . we split it into two sums depending on 
when index j < Sn or j > Sn (recall 6 from (|3.1ip ): 



E 



lSn\ 



j=k!+l 



E 



1 



53 L{a n {j/n),XJ,<p K (j/n));, 



j- [Sn] 



(3.20) = B1 + S2. 

Step 7. To estimate By, we divide it further into two terms corresponding to sums 
on indices j < n 7 / 8 and n 7 / 8 < j < 5n: 
L« 7/8 J 



- 53 L(<T n (j/n),X?,<p K (j/n));. 

71 1 * J 



L5nJ 



- 53 L(a n (j/n),Xj,ip K (j/n)); 

71 < J J 



= Di+D 2 . 

The first term D±, using (|3.17p , is bounded for all large n by 
(3.21) n- 1 / 8 log(n 7 / 8 /7 ? 2 ) < e. 
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The second term D 2 , using again ()3.17|) . is bounded in absolute value by 

-<51ogr?+- l lo g( s J-feo + l/«)l 



-E 



0<j<n-l 



i=0 



2nV« 



Now, note that (|3.12|) implies for j > fci + 1 that 



< Sj-fco + l < 



(1 — ?7)n 71 7771 

Then, as < 6 < 1/3 and < 77 < 1/2, we have for large n that 

- E l lo g( s i-feo+i/")l < 2 / max ( logft— ) , log(-) Ida; 
n . f-^ /81 7o L VI - 77/ V77/ J 

< —6(5 log <5 — 2(5 log 77. 
Also, noting (|3. 16[> . we have for large n that 



-E 



< 



< 



[Snj 



1=0 



1 



E ^ 

J=[n 7/8 1 
L5nJ 

- E ^ 



gj g 

n 2n 1 /s 



1 



2n 



(9j; 



< -2 / logy da; < -4(5 log <5- 2(51og< 



i= r„7/8-| 

By combining these estimates, we have D2 is bounded by a function of (5, 77, 9 which, 
given (|3.13p . can be made small: 

(3.22) D 2 < - 10.5 log <5- 3(5 log 77 -25 log 6 < e. 

Step 8. We now estimate the second term B 2 in (|3.20[) . Note, for n > 5~ 8 , by 
(|3~Il)j) the event 

j sup \xf - l^ K {l/n)\ < JLg) C { inf X; > 6 1\. 

^ 0<j<n-l n 2n i / s J I j>5n J 2 J 

Hence, for large n, 



1=0 



Bo < E 



■ n { inf X? > - 

. j>5n 3 2 



} 



■ V L(a n (j/n),X?,p K (j/n));A c n\ h 

j= \Sn] 

Note, from assumption (|1.2[) and X" < j/n, that 5/(1 — 77) < (T n (j/n) < I/77 
and X™ < <7 n (j'/n)(l — 77) when 5 < j/n < 1 for all large n. Also, L(t,x,y) is 
continuous, and therefore also bounded and uniformly continuous on the compact 
set (cf. definition of L ([331), $SM ), 

(3.23) {(t, x, y) : ^— < t < I/77, ® < x < (1 - 77/2)7;, < y < l}. 
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Then, 



lim sup E 



(3.24) 



Further, 



(3.25) 



- L(<T n (j/n),X?,V K (3/n));A c n{ inf X? > -} 

1 ™~ 1 / 1 j 

< limsup- Y L { a n(j/n),-^2(p K (l/n),tp K (j/; 



j=\Sn\ 



1=0 



lim sup - } ip K (l/n) - / <p K (s)d. 



= 0, 



as ipk is piecewise constant and bounded. Then, given the bounds on cr n (j/n) 
above, 9j/n < (1/n) Ya=o (PkQ/n) < j/n (cf. (|3.16|) ). and uniform continuity of L 
on the compact set (|3.23|) . we may analogously bound (I3.24|) by 



^ n — l 

(3.2B)n - V L(a n (j/n),<p K (j/n),tp K (j/n)) 

n — >oo Tl — ' 



3=1 Sn] 



L{at,ip K {t),ip K (t))dt. 



Step 9. Finally, with respect to the second term in Q3.ip . by (|3.14[) and (|3.25[) . in 

the sup topology, lim^^ h(X n ) = h{ip K {-)). 

We now combine all bounds to conclude the proof of (|3.6|) . By (|3.1|) , and bounds 
(I3~ig)l . (j3~2T1) . (I3~2~2l . (I3~2^D . we have 



lim sup V n < lim sup E 



rl 



ra-1 



- £ L(a n (i/n),X; ! ^(j/n))+/ l (X") 



j=fci+i 



< 3e+ / L(at,ip K (t),tp K (t))dt + h(cpk). 
Js 

Then, by (j3~TT|) , (|3~g| . and (j3~TU]) . we obtain (j3~H|l . 



4. Proofs of Theorems 11.11 and 11.41 

We first address the proof of Theorem 11.11 and later the proof of Theorem 
11.41 in Subsection 14.31 Since Z n /n = " + ^° - 1 X n+ k g - 1 ( 1 ) , by contraction principle 
Theorem 11.31 implies Theorem 11.11 with the rate function / given in (|1.14[) . As 
x t— > \x is a bounded continuous function on [0, 1], by Varadhan's Integral Lemma 
(see QU Theorem 4.3.1], or [20l Theorem 1.3.4]), this implies that the limit (fTTT]) 
exists, and equals 

A(A) = sup{A^(l) - I{<p)} 

= sup { [\xy{t)-v{t)\o g ^L 

V : V {0)=0 L JO at ~ fW 

(4.i) H1 - miog ^m ]dt y 
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Direct derivation of formula (|1.12p or even formula (|2.1j) for a = 2 from (|4.1j) seems 
quite challenging (cf. [11]). The Euler equations are: 

Cp a 1 

cp(l — cp) at — cp cp 

cp(0) = 
vK 1 ) a-cp(l) A 

1-0(1) cp(l) 6 ■ 

Numerical evidence suggests that the solutions of the Euler equations indeed give 
the correct answer. 

4.1. Proof of Theorem 11.11 In this section we show that one can use Theorem 
11.31 to set up the differential equation (|1.7[) which implies formula (|2.ip . Recall 
notation for the moment generating function. As we already noted, Theorem 11.31 
implies that 

(4.2) -logm„(A) A(A) 

n 

with A(A) given by (|4~Tj) . 

Formula (|2.ip follows from the following additional fact. 

Proposition 4.1. A(A), as defined by (|4.2| . is differentiable and satisfies equation 

It is straightforward to verify that (|2.1[) satisfies (|1.7|l . The following argu- 
ment shows the uniqueness: Suppose Ai, A 2 are two solutions with initial condition 
A(0) = 0. If A 1 (t) = A 2 (t) for some t > 0, then they coincide for all t > 0. There- 
fore, we must have Ai(i) > A 2 (i) for all t > 0. By the mean value theorem, there 
is to > such that A[(to) - A 2 (i ) > 0. But the equation gives A[(t) - A 2 (i) = 
a eAl(t ^Z^ 2W < for all t > 0. Similarly, if Ai(t) = A 2 (<) for some t < 0, then they 
coincide for all i < 0. Therefore, we must have Ai(t) > A 2 (£) for all t < 0. By the 
mean value theorem, there is to < such that A[(to) — A 2 (<o) = Al ^^ A2 ^^ < 0. 
But the equation gives A£(*) - A£(t) = a e [Ztt > for all t < 0. 

4.2. Proof of Proposition 14. ll As mentioned in the introduction, the differential 
equation is easy to derive heuristically from (|1.4|) : the main technical difficulty is 
in justifying the convergence of various expressions. 

To this end, the key ingredient is the control of the complex zeroes of m n (z), 
based on (the proof of) [TQ1 Theorem 1] . Note that the assumptions in the following 
Proposition ^. 21 include s n satisfying (|1.2[) , and s n — an for a — 1/2, 1 with ko < s%. 

Proposition 4.2. Suppose m n (z) is defined by (II. 3p with initial condition toi(A) = 
gfc A SMC /j that {s n } satisfies si > ko, s n > maxjfco, 1} for n > 2. We also assume 
that there is no > 1 such that s no > ko when ko = 1, 2, . . . and that there is n\ > 2 
such that s ni > 1 when ko = 0. Then m n (z) ^ in the strip |3(z)| < 7T. 

Proof. We note that m„(A) = p„(e A ) for a polynomial p n {u) with non-negative 
coefficients. Then, to prove the result, we repeat the proof of [TOl Theorem 1] to 
deduce that all complex zeros of p n {u) are real, and non- negative for n > 1. [Some 
minor details differ from [10].] 



24 



WLODEK BR.YC, DAVID MINDA, AND SUNDER SETHURAMAN 



Since m' n (X) — up' n (u), (11. 3|) gives 

Pn+l{u) = p n {u)+up n (u) 

(4.3) = ^(l-u^fa-tt)-^)]. 

Note that pi{u) = u k ° and that s± > ko- We first give the proof for the case 
when s\ > fco > 1. To reach our conclusion we prove by induction the following 
statement. 

For n > 2, p n (u) is a polynomial of degree n — 1 + ko with a root 
of multiplicity fco at u = and ri — 1 > 1 strictly negative simple 
roots. 

As si > fco j polynomial P2(u) = -^u ko ((si — ko)u + ko) satisfies the inductive 
statement. Suppose for some n > 2, polynomial p n (u) is of degree n — 1 + ko, has 
n — 1 simple negative roots u\ < U2 < ■ ■ ■ < Un-i and a root of multiplicity fco at 
u — 0. Then, from the first equality in (|4.3|) . p„+i(it) has also a root of multiplicity 
fco at u = 0. Clearly, {ui,v,2, . . . ,u„_i,0} are n distinct roots of the expression 
under the derivative on the right hand side of (|4.3p . Since {lij} are simple roots, 
the function must cross the real line, so by Rolle's theorem, p n+ i(u) has n — 1 
distinct roots interlaced between the roots of p n (u). This shows that p n +\(u) has 
n + ko — 1 real roots in the interval (ui, 0]. 

To end the proof, we want to show that the last (n + fc )th root of p n+ \(u) is 
located to the left of u\, so that all negative roots of p n+ i(u) must be simple. To 
see this, we follow again [10]: The first equation in (|4.3|1 shows that p n +i(ui) and 
p' n (u\) have opposite signs, and p' n {u\) ^ as u\ is a simple root. So 7^ 0. 

Since polynomials p n+ i(u) and p n (u) have positive leading coefficients and their 
degrees differ by 1, their signs are opposite as u — > —00. Since ui is the smallest 
root, p n (u) has constant sign for zt < u\ which matches the sign of p n +i{u\). Thus 
Pn+i {u) must eventually cross the real line to the left of u± . This shows that p n+ \(u) 
has n simple strictly negative roots, and a root of multiplicity fco at u = 0, ending 
the induction proof. 

Next, suppose si = S2 = s no _i = fco > 1, but s no > fco for some no > 2. Then 
Pi(u) = p2(u) = ■ ■ ■ = p no ( u ) — uk ° an d the inductive proof goes through with 
minor modifications, starting with p no +i — ~r~ uk ° (( s n — ko)u + ko) that replaces 
P2(u) in the previous argument. 

Finally, if fc = 0, then p\(u) = 1, P2(u) — u. Choose first n\ > 2 such that 
s ni > 1 but s n = 1 for 2 < n < n\. Since in this case the value of s\ is irrelevant, 
we get P2W) = ■■■ = PnAu) = u. The induction proof proceeds with minor 
modifications, starting with p n ,+i(u) — —u((s n , — l)u + 1). □ 

Lemma 4.3. A is differentiable and ^ ^ — > A'(A). 

Proof. Recall that if /(£) is holomorphic in \C,\ < R, then for < r < R 

(4.4) max|/(C)| < f^|/(0)| + max K/(C) 
|C|<r R — r R — r \(\<R 

(cf. [H Theorem 6.31(h)]). 

By Proposition 14.21 for fixed n > 1, the function m n {z) is holomorphic and 
nonzero in the strip \^sz\ < it. Since m n (z) is a polynomial in e z with nonnegative 
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coefficients, m n (t) > for all t € R, m n (t) is increasing on R, and |m„(z)| < 
m„(|^|). 

The function m n (z) has a holomorphic logarithm L n (z) on the strip \$iz\ < tt. 
Because m n (t) > for t G R, we may assume that = log(m„(i)) for all tel. 

For each t S R, we can apply fL4|) to /(() = n,- 1 L n (t + (), \(\ < tt, with R = 2tt/3 
and r = 7r/3. This gives 

max n^lLjz)! < 31 rz" 1 i. Tl (t) I + 2n _1 max |5Ri„(C)| 

|z— t|<7r/3 |z-i|<7r/3 

= 3|n _1 log(m„(i))| + 2n~ 1 max log|m„(z)| 

|z— t|<7r/3 

< 3|n- 1 log(m n (t))|+21og(m n (|t|+27r/3)). 

Since e~™'*' < m n (t) < e™'*', {n _1 L„(z) : n > 1} is a normal family (i.e., a 
uniformly bounded family of holomorphic functions) in the disk \z — t\ < 

We now note that {n~ 1 L n (t)} converges for all real t; this holds true by (|4.2p 
when a > 1, or by Proposition 12.11 when s n = n or s n — n/2. A version of Vitalli's 
theorem, see [22l p. 9], implies that n^ 1 L n (z) — > A(z) in the strip |3z| < ir/3, 
the convergence is uniform in each disc \z — t\ < 7r/3, the limit A(z) is an analytic 
function of the argument z in that strip, and all derivatives of nT x L n converge to 
the corresponding derivatives of A. In particular, the sequence n~ 1 m' n (X) /m n (X) — 
n _1 L^(A) converges to A'(A) for all real A. □ 

Conclusion of Proof of Proposition ^ ■ 1\ By Lemma l4.3l the right hand side of (|1.4p 
converges, so the left hand side must converge too: m ^ l + ]^ > — > expi(A) for some L 
uniformly in a neighborhood of A. Since the limit of ratios implies the same limit 
for 7i-th roots, we get ^ logrn„(A) — > L(X), which identifies L(X) = A(A) as the 
pressure. From Lemma 1431 the derivative m' n (\) / (nm n (\)) — ► A'(A), so passing to 
the limit in (jl.4[) one obtains the differential equation for the pressure. □ 

4.3. Proof of Theorem 11.41 In the following, s n satisfies assumptions (ll.2j) . or 
s n = n or s n = n/2. The LLN follows from the strict convexity of / in Theorem 
Ql and A'(0) = a/(l + a). 

For the CLT, we recall [12, Proposition 2] in our context: When sup„ m„(e) 1 /™ < 
oo for some e > 0, ^ closure(U rl >i2'e n ) where Ze n is the zero set of m n (z) = 
E[exp{zZ n }], and Z n /n satisfies an LDP, then (Z n — E[Z n ])/ ^fn converges in dis- 
tribution to N(0, a 2 ) where a 2 = A"(0). 

To verify assumptions, note by Theorem 11.11 that the LDP for Z n /n holds and 
lim rwoo n^ 1 \ogm n {\) = A(A) for all A G R, and by Proposition 14. 21 that m n (z) has 
no zeroes m the strip |3f(z)| < tt for n > 1. Finally, A"(0) = a 2 /[(l + a) 2 (2 + a)} 
to finish the proof. □ 

5. Concluding remarks 
We now comment on some possible extensions. 

1. Cases a = 1/2, 1. Although we prove a LDP for Z n /n when s n is linear with 
slopes a = 1/2,1 (Remark 11.21 Proposition 12. ip . the proof of Theorem 11.11 for a 
path LDP with respect to Z^ nt j/n, especially the lower bound argument, does not 
cover these cases. The difficulty is in controlling boundary behavior as estimate 
(|3.12p is not available. It would be interesting to look further into these issues. 
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2. Higher order statistics. With respect to random graph models, one might ask 
about LDP's for the vector Z„/n = (Zi(n), . . . , Zk(n))/n where the jth component 
Zj(n) counts the number of vertices with degree j < k for k > 2. In principle, our 
method to analyze the leaves can be used to study Z„/n. Indeed, the Dupuis-Ellis 
type arguments given here for a path LDP for the leaves Z^ ( \nt\ ) / n (Theorem II. ip 
would seem to extend to the vector- valued paths Z^ nt , jn. 

However, to calculate the pressure 

1 k 
A fc (Ai, . . . , Afe) = lim -logE exp I V" A»^(n) \ 

i=l 

as in Theorem II. 31 for the leaves, the differential equation which now arises for 
in place of the ODE for A (|1.7[) . is a quasilinear PDE with k > 2 independent 
variables. These PDE's, although in principle implicitly solved by the method of 
characteristics, unfortunately do not seem to admit explicit solutions, at least to the 
extent found here with respect Z\ (n) /n, a reason why we have focused on detailed 
investigations on the leaves. It would be of interest to study better these higher 
order questions. 

Acknowledgements. We thank Professor S.R.S. Varadhan for illuminating dis- 
cussions. 
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